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Introduction 

My  laboratory  has  long  been  dedicated  to  the  application  of  genetic 
principles  to  both  the  understanding  of  cancer  biology  and  the  search  for  novel 
anticancer  targets.  Since  the  1970s,  the  war  on  cancer  has  been  based  on  the 
notion  that  studying  the  disease  will  lead  to  the  discovery  of  vulnerabilities,  which 
can  be  exploited  in  the  clinic.  While  many  underlying  genetic  determinants  of 
cancer  have  been  identified,  this  knowledge  has  failed  to  translate  into  new 
therapeutic  strategies,  with  only  a  handful  of  exceptions.  One  hypothesis  is  that 
this  failure  has  largely  been  due  to  the  genetically  intractable  nature  of  cultured 
mammalian  cells.  In  part  due  to  work  from  my  laboratory,  dsRNA-induced  gene 
silencing,  or  RNA  interference  (RNAi),  in  mammalian  systems  has  emerged  as  a 
tool  that  is  likely  to  re-invigorate  the  field  of  somatic  cell  genetics  and  in  the 
process  revolutionize  the  study  of  human  disease.  During  the  past  year,  we 
have  reached  a  point  at  which  any  gene  in  the  human  genome  can  conceivably 
be  targeted  using  small  dsRNA  gene  silencing  triggers  -  small  interfering  RNAs 
(siRNAs)  or  expressed  short  hairpin  RNAs  (shRNAs).  I  have  recently  received 
an  innovator  award  from  this  same  program  that  will  fund  the  construction  of 
shRNA-expression  cassettes  on  a  large,  and  possibly  whole-genome  scale.  We 
will  be  able  to  use  these  tools  within  the  present  programs  to  test  the  notion  that 
synthetic  lethality  is  a  plausible  approach  to  the  discovery  of  novel  anticancer 
targets.  I  note  that  the  goals  of  this  grant  and  that  of  the  Innovator  award  are 
distinct.  The  innovator  award  is  to  develop  high-throughput  procedures  to  create 
a  resource  for  the  entire  cancer  community.  This  program  will  apply  that 
resource  to  a  very  specific  problem  in  cancer  biology. 

Body 

A  Genome-wide  shRNA  library 

Over  the  past  year  and  with  the  support  of  an  Innovator  award,  we  have 
constructed  a  library  of  RNAi-inducing  constructs  that  covers  ~1 0,000  cancer 
relevant  genes  with  from  1  to  6  shRNAs  each.  This  library  is  the  tool  that  is 
necessary  to  move  fonward  with  large-scale  screens  for  synthetic  lethality  in 
mammalian  cells.  The  library  itself  is  being  constructed  with  support  from  a 
number  of  sources  (consider  that  the  overall  cost  will  be  -$9,000,000).  Since  it  is 
a  key  reagent  for  next  year’s  studies,  it  is  worthwhile  to  describe  the  resource 
here. 


The  overall  goal  was  to  construct  a  library  of  RNAi-inducing  constructs 
corresponding  to  each  gene  in  the  human  genome.  The  first  phase  of  the  project 
was  to  target  a  number  of  genes  that  are  relevant  to  cancer  and  to  make  the 
corresponding  constructs  available  to  the  research  community.  Specifically,  the 
National  Cancer  Institute  agreed  to  support  the  construction  of  a  total  of  9,000 
shRNA  expression  vectors  targeted  against  3,000  cancer  relevant  genes. 
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Additional  constructs  were  and  continue  to  be  made  with  support  from  the 
Innovator  award  and  with  commercial  funding. 

Gene  selection 

The  overall  philosophy  of  which  genes  to  target  was  discussed  extensively 
with  leading  investigators  in  the  cancer  research  community  and  with  the  NCI. 
Categories  of  genes  to  be  targeted  were  determined  and  responsibilities  for  the 
construction  of  gene  lists  were  apportioned.  We  followed  as  closely  as  possible 
the  guidelines  for  gene  choice  and  built  lists  of  cancer-relevant  genes  on  our  own 
and  in  consultation  with  other  experts  in  cancer  biology.  The  list  comprised 
several  functionally  chosen  groups,  such  as  kinases,  proteolysis  genes, 
phosphatases,  GPCRs  and  trafficked  proteins  that  are  considered  accessible  to 
medicinal  chemistry.  In  addition,  we  prioritized  genes  based  upon  cellular 
processes  such  as  DNA  repair,  cell-cycle  and  growth  control.  Finally,  we  added 
lists  of  genes,  which  had  been  deemed  relevant  to  cancer  in  a  number  of  ways. 
For  example,  a  group  of  investigators,  including  Joan  Brugge,  Ed  Harlow  and 
Josh  LeBaer  at  Harvard  have  hand-collated  1000  genes  which  are  implicated  in 
breast  cancer.  We  have  also  included  genes  based  upon  collated  microarray 
data  on  breast  and  lung  cancer.  Combining  all  of  these  sources  of  information 
led  to  a  list  of  -9,300  high  priority  genes.  It  is  our  hope  that  the  NCI  will  find  this 
list  satisfactory,  but  we  are  certainly  willing  to  add  to  the  list  any  genes  that  they 
feel  might  contribute  to  the  utility  of  our  overall  resource. 

shRNA  optimization 

We  tested  carefully  a  large  number  of  different  strategies  to  achieve  an 
optimal  configuration  for  shRNA  expression  and  structure.  This  ultimately 
translated  into  a  29  nucleotide  hairpin  with  a  simple  4  base  loop.  This  is 
expressed  from  the  U6  snRNA  promoter  and  is  preceded  by  the  U6  leader 
sequence.  We  continue  to  update  our  strategy  and  have  just  begun  tiling 
experiments  with  2  genes  to  attempt  to  derive  rules  for  choosing  optimal 
shRNAs. 

Vector  construction  and  validation 

In  collaboration  with  Steve  Elledge,  we  constructed  a  flexible  vector 
system  for  harboring  the  shRNA  library  (Figure  1).  We  have  demonstrated  that 
this  vector  can  transfer  shRNA  inserts  to  a  recipient  plasmid  by  bacterial  mating 
with  -1 00%  efficiency.  We  have  also  validated  transfers  in  multi-well  formats 
suitable  for  moving  subsets  of  or  even  the  entire  library. 
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Figure  1 .  shRNA  library  vector 


shRNA  library  construction 

We  have  purchased  ~40,000  oligonucleotides  corresponding  to  the 
~1 0,000  targeted  genes.  Initial  purchases  were  from  lllumina.  The  performance 
of  the  first  10,000  oligos  was  satisfactory  but  certainly  less  than  optimal.  The 
performance  of  the  second  10,000  oligos  was  horrendous.  Fully  30%  of  the  96 
well  plates  that  were  delivered  produced  complete  failures  in  our  cloning 
procedure.  The  performance  on  the  remainder  was  also  abysmal,  lllumina  could 
provide  no  quality  control  data  to  assure  us  that  the  material  that  they  had 
shipped  were  even  oligonucleotides.  This  caused  a  significant  delay  and  waste 
of  resources,  from  which  we  are  still  recovering,  lllumina  did  provide  a  refund  of 
~$1 00,000,  in  consideration  for  this  failure.  However,  this  was  significantly  less 
than  the  cost  of  demonstrating  the  low  quality  of  their  oligos.  We  have  now 
changed  suppliers  to  Sigma-Genosys  and  see  much  better  and  more  consistent 
performance. 

To  date,  we  have  sequence  verified  ~25,000  individual  shRNAs.  Overall 
library  statistics  as  of  1 1/10/03  are  shown  below.  These  have  been  compressed 
onto  ~200  96-well  plates,  and  approximately  1 0,000  of  the  clones  have  been 
verified  for  release  to  the  scientific  community  upon  publication  of  the  manuscript 
(see  Figure  2). 
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Full  Libraiy 


Figure  2.  Progress  toward  library  construction 

Each  of  the  verified  shRNAs  is  associated  with  a  ~60  nucleotide  barcode. 
Sequence  analysis  suggests  that  these  are  >95%  unique.  We  have  not  yet 
decided  how  to  handle  the  5%  which  are  associated  with  more  than  one  shRNA, 
but  given  the  high  probability  that  the  library  will  be  most  often  used  in  restricted 
pools,  we  don’t  feel  an  immediate  need  to  replace  those  shRNAs  with  redundant 
barcodes  until  the  first  phase  of  the  project  is  complete. 

Given  steady  progress,  we  anticipate  finishing  this  set  of  genes  within  the 
next  couple  of  months. 

Working  with  Rosetta  Inpharmatics,  we  have  validated  in  pilot  experiments 
the  use  of  in  situ  synthesized  oligonucleotides  for  library  construction.  Within  the 
next  several  months,  we  should  move  into  the  final  phase  of  deciding  whether 
this  method  of  library  construction  is  suitable  for  large  scale  use.  This  agreement 
does  not  restrict  distribution  of  clones  derived  from  Rosetta  oligos  in  anyway. 

Library  Distribution 

Open  Biosystems  has  agreed  to  distribute  the  library  and  individual  clones 
from  the  library  at  a  relatively  low  cost  to  non-commercial  researchers. 
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Loss  of  function  genetics 

The  original  goal  of  this  grant  was  to  use  genetic  technologies  to  enable 
loss-of-fu notion  approaches  toward  finding  new  anti-cancer  targets  in  mammalian 
cells.  The  discovery  that  RNAi  could  be  used  as  such  a  tool  has  been  a 
tremendous  boon.  The  availability  of  a  large-scale  RNAi-based  library  now 
makes  our  original  goals  possible. 

We  are  taking  two  approaches  toward  identifying  synthetic  lethal 
interactions.  The  first  is  to  use  very  standard  approaches,  using  either 
morphological  markers  of  cell  death  or  fluorescent  reports  of  caspase  activity  in 
vitro  to  identify  lethal  interactions.  The  second  is  to  use  highly-parallel 
approaches  based  upon  the  use  of  barcode  arrays  to  identify  such  interactions. 

One-by-one  approaches 

To  validate  the  use  of  the  library  in  well-by-well  screens,  we  began  by 
examining  the  ability  of  shRNAs  within  the  collection  to  provoke  a  well 
established  phenotype.  By  knowing  the  answer  beforehand,  we  might  get  a 
good  appreciation  for  how  well  the  resource  would  work  in  a  less  characterized 
context. 

We  chose  to  examine  7,800  shRNA  constructs  for  the  ability  to  inhibit 
proteosome-mediated  degradation  of  a  direct  proteosome  target 
(ZsGreen Prosensor,  Clontech).  This  contains  the  ODC  degron. 

Cells  were  co-transfected  with  the  construct  and  individual  shRNA 
expression  vectors  that  had  been  prepared  robotically.  For  normalization,  a 
dsRED  expression  vector  was  also  included.  Fluorescent  signals  were  read  on  a 
Victor  11  plate  reader,  and  wells  were  normalized  based  upon  the  dsRED 
transfection  marker. 

A  visual  output  of  the  data  is  shown  in  Figure  3.  We  saw  a  significant 
enrichment  for  shRNAs  that  target  the  proteosome  in  this  screen.  Secondary 
tests  revealed  that  roughly  22/55  proteosome  targeted  shRNAs  scored  positive  in 
our  assay,  consistent  with  our  expectation  that  about  1/3  of  the  shRNAs  in  the 
library  would  be  effective. 
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Genome  wide  analysis  of  the  proteasome-mediated 
proteolysis  pathway 


♦  Library  shRNA 


A  Positive  shRHA  targeting 
^  proteasome  subunit 

Cut  off 


library  shRHA 


Positive  shRHA  targeting 
proteasome  subunit 

Cut  off 


Figure  3.  A  screen  of  7,800  shRNAs  for  proteosome  inhibitors 

We  are  now  ready  to  use  these  same  validated  DNA  preparations  in  96- 
well  plate-based  approaches  for  synthetic  lethality  screens. 

Highly  parallel  approaches 

Since  individual  vectors  contain  unique  barcodes  that  have  been 
associated  with  each  shRNA  by  sequence  analysis,  we  can  use  a  population- 
based  approach  to  identify  genes  that  interact  with  specific  pathways.  In  short,  a 
population  of  cells  is  infected  with  the  mixed  library  under  conditions  in  which 
each  construct  is  represented  by  multiple  individual  infected  cells  (-1000 
cells/construct).  This  population  is  subjected  to  a  stress  (genetic,  environmental 
or  chemical)  while  a  control  population  remains  untreated.  The  barcodes 
representing  individual  vectors  are  then  recovered  from  the  populations  by  PCR. 
The  frequency  change  in  each  clone  within  the  population  can  then  be  measured 
by  competitive  hybridization  of  the  barcode  material  to  DNA  microarrays.  For 
those  shRNAs  that  confer  resistance  to  a  treatment,  we  expect  enrichment. 
Conversely,  for  those  barcodes  that  sensitize  a  cell  to  a  particular  treatment,  we 
expect  a  reduction  in  frequency. 

The  overall  approach  is  depicted  schematically  in  Figure  4. 
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Highly  parallel  phenotypic  screens  using  RNAi  libraries 


Figure  4  Highly  parallel  approaches  toward  synthetic  lethal  phenotypes. 

Thus  far,  we  have  examined  and  validated  the  underlying  technology.  As 
an  example,  we  infected  normal  human  diploid  fibroblasts  (IMR90)  with  the 
library  under  conditions  where  the  MOI  was  less  than  1/cell.  After  48  hours  of 
infection,  DMA  was  harvested  and  two  independent  populations  were  compared 
by  microarray  analysis  to  examine  the  specificity  and  reproducibility  of  the  assay. 
The  data  is  presented  as  Figure  5. 
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Figure  5  Validation  of  the  barcode  strategy 

Overall,  there  is  a  very  high  degree  of  correlation  between  the  two 
samples.  Furthermore,  the  of  the  several  thousand  negative  controls  on  the  chip, 
all  but  a  few  show  no  detectable  hybridization.  We  are  now  performing  time- 
course  experiments  to  determine  the  contribution  from  genetic  drift  in  populations 
so  that  we  might  set  appropriate  parameters  for  synthetic  lethality  screens. 

Key  Research  Accomplishments 

•  Preparation  of  7,800  DNAs  corresponding  to  shRNA  expression  vectors 
from  the  library 

•  Validation  of  the  library  in  pilot  genetic  screen 

•  Design  of  microarrays  to  monitor  shRNA  populations 

•  Validation  of  the  barcoding  strategy  for  genetic  screens 

Reportable  outcomes 
None 
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Conclusions 

We  now  have  in  place  the  tools  for  large-scale  synthetic  lethality  screens 
in  mammalian  cells. 
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